Evolution of protein sequences and structures.

نویسندگان

  • T C Wood
  • W R Pearson
چکیده

The relationship between sequence similarity and structural similarity has been examined in 36 protein families with five or more diverse members whose structures are known. The structural similarity within a family (as determined with the DALI structure comparison program) is linearly related to sequence similarity (as determined by a Smith-Waterman search of the protein sequences in the structure database). The correlation between structural similarity and sequence similarity is very high; 18 of the 36 families had linear correlation coefficients r>/=0.878, and only nine had correlation coefficients r</=0.815. Inclusion of higher-order terms in the structure/sequence relationship improved the fit by less than 7% in 27 of the 36 families. Differences in sequence/structure correlations are distributed evenly among the four protein structural classes, alpha, beta, alpha/beta, and alpha+beta. While most protein families show high correlations between sequence similarity and structural similarity, the amount of structural change per sequence change, i.e. the structural mutation sensitivity, varies almost fourfold. Protein families with high and low structural mutation sensitivity are distributed evenly among protein structure classes. In addition, we did not detect strong correlations between structural mutation sensitivity and either protein family mutation rates or protein size. Our results are more consistent with models of protein structure that encode a protein family's fold throughout the protein sequence, and not just in a few critical residues.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structural Characteristics of Stable Folding Intermediates of Yeast Iso-1-Cytochrome-c

Cytochrome-c (cyt-c) is an electron transport protein, and it is present throughout the evolution. More than 280 sequences have been reported in the protein sequence database (www.uniprot.org). Though sequentially diverse, cyt-c has essentially retained its tertiary structure or fold. Thus a vast data set of varied sequences with retention of similar structure and fun...

متن کامل

(مقاله کوتاه) تجزیه فیلوژنی و تکامل مولکولی لپتین

     In the current study, phylogenetic analysis and molecular evolution of the mammalian’s Leptin was investigated. Data was achieved and aligned by searching its genome database, while all examined mammals contained only a single copy of the Leptin. The nucleotide substitution rate of the sequences and molecular evolution of the Leptin were calculated by maximum likelihood and neighbor-joinin...

متن کامل

Computational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)

Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...

متن کامل

Comparative Phylogenetic Perspectives on the Evolutionary Relationships in the Brine Shrimp Artemia Leach, 1819 (Crustacea: Anostraca) Based on Secondary Structure of ITS1 Gene

This is the first study on phylogenetic relationships in the genus Artemia Leach, 1819 using the pattern and sequence of secondary structures of internal transcribed spacer 1 (ITS1). Significant intraspecific variation in the secondary structure of ITS1 rRNA was found in Artemia tibetiana. In the phylogenetic tree based on joined primary and secondary structure sequences, Artemia urmiana and pa...

متن کامل

Bioinformatics study of complete amino acid sequences of neuraminidase (NA) antigen of H1N1 influenza viruses from 2006 to 2013 in Iran

Introduction: Influenza is a contagious acute viral disease of the respiratory tract that causes fever, headache, muscle aches and cough. One of the unique features of influenza virus is antigenic variation in viral protein neuraminidase (NA) which causes emergence of new virus variants. NA is responsible for the release and spread of progeny virions. Due to the continuous changes of NA genes, ...

متن کامل

Characterization of the Full Length Coat Protein Gene of Iranian Grapevine fanleaf virus isolates, genetic variation and phylogenetic analysis

The full-length coat protein gene of Grapevine fanleaf virus (GFLV) isolates from Iran was characterized byreverse transcription polymerase chain reaction (RTPCR) and sequencing. The expected 1515 bp coatprotein (CP) gene amplicon was obtained for 16 isolates out of 89 that were identified by double antibodysandwich enzyme-linked immunesorbent assay (DASELISA) in a population ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of molecular biology

دوره 291 4  شماره 

صفحات  -

تاریخ انتشار 1999